Data Confidentiality

نویسنده

  • Jerome P. Reiter
چکیده

When releasing data to the public, data disseminators typically are required to protect the confidentiality of survey respondents’ identities and attribute values. Removing direct identifiers such as names and addresses generally is not sufficient to eliminate disclosure risks, so that data must be altered before release to limit the risks of unintended disclosures. When intense data alteration is needed to ensure protection, the quality of the released data can be seriously degraded. This article reviews a disclosure limitation approach called synthetic data, in which values of confidential data are replaced with simulations from statistical models. Theoretical and empirical investigations have shown that synthetic data approaches have the potential to result in higher data quality than other disclosure limitation procedures, particularly when intense data alteration is necessary. The article discusses the main variants of synthetic data approaches, namely full synthesis and partial synthesis. It includes discussions of synthetic data generation and disclosure risk assessment. Many national statistical agencies, survey organizations, and researchers disseminate microdata, i.e., data on individual units, to the public. Wide access to microdata facilitates advances in science and public policy, encourages replication of findings, enables students to train on genuine datasets, and helps citizens to stay informed about their society. Often, however, data disseminators cannot release microdata in their collected form, because doing so would reveal some respondents’ identities or values of sensitive attributes. Data disseminators therefore typically alter the collected data before release. Common strategies include recoding variables, such as releasing ages in five year intervals or geographies at high levels of aggregation; reporting exact values only above or below certain thresholds, for example reporting all ages above 90 as “90 or more;” swapping data values for selected records, e.g., exchange two or more individuals’ demographic data (Dalenius and Reiss, 1982); and, adding noise to numerical data

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Security, confidentiality, and privacy of information in the field of health with data EPR embedding in medical MRI images based on HVS model

the development of new technology and modern equipment has led to the development of telemedicine systems. As a result, there are dangers such as publishing patient information and intentionally or unintentionally, medical information. The forensic organization, as one of the powerful arms of the judiciary, pursues important cases in the medical and psychiatric commissions to take steps to rea...

متن کامل

مقایسه‌ سطوح دسترسی و محرمانگی مدارک پزشکی در کشورهای منتخب و ایران

Introduction: Undoubtedly, the medical record is one of the most important documents containing the most sensitive information on the public health and treatment. As a matter of fact, protecting the confidentiality of the recorded information and the documents there in should be given top priority. Thus, given the importance of the confidentiality of medical document, and their impact on the be...

متن کامل

A Method for Protecting Access Pattern in Outsourced Data

Protecting the information access pattern, which means preventing the disclosure of data and structural details of databases, is very important in working with data, especially in the cases of outsourced databases and databases with Internet access. The protection of the information access pattern indicates that mere data confidentiality is not sufficient and the privacy of queries and accesses...

متن کامل

Study of Healthcare Service Recipients' Perceptions Regarding Observance of Patient Privacy and Medical Confidentiality in Teaching Healthcare Centers Affiliated with the Qom University of Medical Sciences in 2015-2016, Iran

Background and Objectives: Medical confidentiality and maintenance of patient personal privacy are considered two important moral obligations in medical ethics with a long history in medicine. To be efficient, a healthcare system needs active participation of and appropriate cooperation between the recipients and providers of healthcare services. This study was conducted to investigate healthca...

متن کامل

Important Issues on Statistical Confidentiality Methods

This paper sets out, in the context of official statistics, some of the key issues of confidentiality and the methods developed to maintain confidentiality. The relevance of the issues and methods to data mining of official data are discussed. Recent developments that will increase the availability of microdata for scientific research are outlined.

متن کامل

Confidentiality and Integrity in Distributed Data Exchange

Confidentiality and Integrity in Distributed Data Exchange

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011